Ph.D. Research Proposal Information Extraction for Legal Documents
نویسنده
چکیده
Information Extraction technique, which is a process of converting unstructured text into structured data, has been used in a variety range of applications such as traffic monitoring systems, biomedical systems and more general open knowledge bases for open language learning. However, the use of information extraction in legal area is still quite limited and suffers difficulties. For example, the existing system AustLII, which is developed and maintained by University of Technology Sydney and University of New South Wales, uses traditional information retrieval approach to organise all the legal documents and the system only supports keywords search. Another example, the LexML Web project in Brazil also applies similar approach. Although more sophisticated techniques such as case based reasoning, semantic network and ontology based approaches are introduced, not many of them are actually used in an integrated application. Hence, the aim of this study would be to come up with new design that applies information extraction techniques in order to construct a more accurate legal text management system. The new system is designed to manage laws as well as legal cases for both legal study and use. Comparing with AustLII, the new system will also focus on Australia laws but provide more accurate search results because of the use of information extraction methods. Australasian Legal Information Institute
منابع مشابه
Automatic Legal Document Analysis: Improving the Results of Information Extraction Processes using an Ontology
Information Extraction (IE) is a pervasive task in the industry that allows to obtain automatically structured data from documents in natural language. Current software systems focused on this activity are able to extract a large percentage of the required information, but they do not usually focus on the quality of the extracted data. In this paper we present an approach focused on validating ...
متن کاملUsing Linguistic Information and Machine Learning Techniques to Identify Entities from Juridical Documents
Information extraction from legal documents is an important and open problem. A mixed approach, using linguistic information and machine learning techniques, is described in this paper. In this approach, top-level legal concepts are identified and used for document classification using Support Vector Machines. Named entities, such as, locations, organizations, dates, and document references, ar...
متن کاملTemporal information extraction from legal documents
The aim of this paper is to analyze what kinds of temporal information can be found in different types of legal documents. In particular, it provides a comparison of different legal document types (case law, statute or transactional document) and how one can do further reasoning with the extracted temporal information.
متن کاملBasic Student Rights within the Iranian Legal System
Having a clear set of basic Iranian students’ legal rights could serve as a practical guideline to commitments and responsibilities of teachers, administrators, and others dealing with education. To this end, all existing legal documents related to the topic were content analyzed. The result has been a set of fourteen principle areas which can be categorized into three subsets depending on the ...
متن کاملOverview of the FIRE 2017 IRLeD Track: Information Retrieval from Legal Documents
The FIRE 2017 IRLeD Track focused on creating a framework for evaluating different methods of Information Retrieval from legal documents. Therewere two tasks for this track: (i) Catchphrase Extraction task, and (ii) Precedence Retrieval task. In the catchphrase extraction task, the participants had to extract catchphrases (legal keywords) from Indian Supreme Court case documents. In the second ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017